Automatic classification of high resolution land cover using a new data weighting procedure: The combination of k-means clustering algorithm and central tendency measures (KMC-CTM)
نویسنده
چکیده
Information on a well-scale urban land cover is important for a number of urban planning practices involving tree shade mapping, green space analysis, urban hydrologic modeling and urban land use mapping. In this study, an urban land cover dataset received from the database of UCI (University of California at Irvine) machine learning was used as the urban land cover data. This dataset is the urban area located in Deerfield Beach, FL, USA. Separately, this dataset is a high definition atmospheric image consisting of 9 different urban land covers. The characteristics of a multi-scale spectral, magnitude and formal tectology were used to sort out and classify these different images. The dataset comprises a total of 147 features and land covers of 9 different areas involving trees, grass, soil, concrete, asphalt, buildings, cars, pools and shadows. A new data weighting method was recommended to classify these 9 different patterns automatically. This recommended data weighting method is based on the combination of the measures of central tendency composed of mean value, harmonic value, mode and median along with the k-means clustering method. In the data weighting method, the data sets belonging to each class within the dataset are first calculated by using k-means clustering method, after which the measures of central tendency belonging to each class are calculated, as well. The measure of central tendency belonging to each class is divided by the set central value belonging to the class in question, as the result of which the data weight coefficient of that class has already been calculated. This calculation process is performed separately for 9 different land covers, and afterwards, these data weighting coefficients found are multiplied by the dataset, and thus, the dataset has been weighted. In the second stage, on the other hand, 3 different classification algorithms containing k-NN (k-nearest neighbor), extreme learning machine (ELM) and support vector machine (SVM) were used to classify 9 different urban land covers after the data weighting method. In determining the educational and test data sets, the 10-fold cross validation was used. When classification through raw data was performed along with k-NN (for k = 1), ELM and SVM classification algorithms, the overall classification accuracy obtained was 77.15%, 84.70% and 84.79%, respectively. When classification through data weighting method (the combination of k-means clustering and mode measure) along with k-NN (for k = 1), ELM and SVM classification algorithms was made, the overall classification accuracy obtained proved to be 98.58%, 98.62% and 98.77%, respectively. The obtained results suggest that the urban land cover in an atmospheric image via the recommended data weighting method was classified as 9 different areas with a high classification success rate. © 2015 Elsevier B.V. All rights reserved.
منابع مشابه
Automatic Interpretation of UltraCam Imagery by Combination of Support Vector Machine and Knowledge-based Systems
With the development of digital sensors, an increasing number of high-resolution images are available. Interpretation of these images is not possible manually, which necessitates seeking for practical, fast and automatic solutions to solve the environmental and location-based management problems. The land cover classification using high-resolution imagery is a difficult process because of the c...
متن کاملPalarimetric Synthetic Aperture Radar Image Classification using Bag of Visual Words Algorithm
Land cover is defined as the physical material of the surface of the earth, including different vegetation covers, bare soil, water surface, various urban areas, etc. Land cover and its changes are very important and influential on the Earth and life of living organisms, especially human beings. Land cover change monitoring is important for protecting the ecosystem, forests, farmland, open spac...
متن کاملDevelopment of an Automatic Land Use Extraction System in Urban Areas using VHR Aerial Imagery and GIS Vector Data
Lack of detailed land use (LU) information and efficient data collection methods have made the modeling of urban systems difficult. This study aims to develop a novel hierarchical rule-based LU extraction framework using geographic vector and remotely sensed (RS) data, in order to extract detailed subzonal LU information, residential LU in this study. The LU extraction system is developed to ex...
متن کاملA Hybrid Data Clustering Algorithm Using Modified Krill Herd Algorithm and K-MEANS
Data clustering is the process of partitioning a set of data objects into meaning clusters or groups. Due to the vast usage of clustering algorithms in many fields, a lot of research is still going on to find the best and efficient clustering algorithm. K-means is simple and easy to implement, but it suffers from initialization of cluster center and hence trapped in local optimum. In this paper...
متن کاملWeighted Ensemble Clustering for Increasing the Accuracy of the Final Clustering
Clustering algorithms are highly dependent on different factors such as the number of clusters, the specific clustering algorithm, and the used distance measure. Inspired from ensemble classification, one approach to reduce the effect of these factors on the final clustering is ensemble clustering. Since weighting the base classifiers has been a successful idea in ensemble classification, in th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Appl. Soft Comput.
دوره 35 شماره
صفحات -
تاریخ انتشار 2015